Adaptive Feature Selection for Hyperspectral Data Analysis Using a Binary Hierarchical Classifier and Tabu Search

نویسندگان

  • Donna Korycinski
  • Melba Crawford
  • Joydeep Ghosh
چکیده

High dimensional inputs coupled with scarcity of labeled data are among the greatest challenges for classification of hyperspectral data. These problems are exacerbated if the number of classes is large. High dimensional output classes can often be handled effectively by decomposition into multiple two(meta)class problems, where each sub-problem is solved using a suitable binary classifier, and outputs of this collection of classifiers are combined in a suitable manner to obtain the answer to the original multi-class problem. This approach is taken by the binary hierarchical classifier (BHC). The advantages of the BHC for output decomposition can be further exploited for hyperspectral data analysis by integrating a feature selection methodology with the classifier. Building upon the previously developed best bases BHC algorithm with greedy feature selection, a new method is developed that selects a subset of band groups within metaclasses using reactive tabu search. Experimental results obtained from analysis of Hyperion data acquired over the Okavango Delta in Botswana are superior to those of the greedy feature selection approach and more robust than either the original BHC or the BHC with greedy feature selection. INTRODUCTION For classification of hyperspectral data, there are potentially hundreds of correlated inputs which may result in unstable estimates of parameters, particularly when there is a small quantity of training data. This problem is commonly addressed using some form of feature extraction, such as feature subset selection. The computational complexity of optimal feature selection methods has forced acceptance of heuristic techniques that find good, near-optimal subsets in relatively short computational times. A comparative study of several of the well-known optimal and sub-optimal feature selection algorithms [e.g. variations of Sequential Forward/Backward Selection (SFS/SBS), Branch and Bound and relaxed Branch and Bound] is contained in [1]. Other ACKNOWLEDGMENT This work was supported by the National Aeronautics and Space Administration under the EO-1 program (NCC5-463). approaches include genetic algorithms [2], simulated annealing [3], and the Tabu Search (TS) metaheuristic [4]. In this study, a new model is developed which incorporates the use of TS with the multiclassifier system known as the Best Bases Binary Hierarchical Classifier (BB BHC) [5-8] for analysis of hyperspectral data. The primary goal in development of the BHC was output decomposition for problems with a medium to large number of classes. While the classification accuracies obtained from the BHC are typically good, problems are encountered if the input is highdimensional, and the amount of training data is limited.

منابع مشابه

Adaptive Feature Selection for Hyperspectral Data Analysis

Hyperspectral data can potentially provide greatly improved capability for discrimination between many land cover types, but new methods are required to process these data and extract the required information. Data sets are extremely large, and the data are not well distributed across these high dimensional spaces. The increased number and resolution of spectral bands, many of which are highly ...

متن کامل

Random Forests of Binary Hierarchical Classifiers for Analysis of Hyperspectral Data

Statistical classification of hyperspectral data is challenging because the input space is high in dimension and correlated, but labeled information to characterize the class distributions is typically sparse. The resulting classifiers are often unstable and have poor generalization. A new approach that is based on the concept of random forests of classifiers and implemented within a multiclass...

متن کامل

Feature selection using genetic algorithm for classification of schizophrenia using fMRI data

In this paper we propose a new method for classification of subjects into schizophrenia and control groups using functional magnetic resonance imaging (fMRI) data. In the preprocessing step, the number of fMRI time points is reduced using principal component analysis (PCA). Then, independent component analysis (ICA) is used for further data analysis. It estimates independent components (ICs) of...

متن کامل

Adaptive feature spaces for land-cover classification with limited ground truth

Classification of hyperspectral data is challenging because of high dimensionality (O(100)) inputs, several possible output classes with uneven priors, and scarcity of labeled information. In an earlier work, a multiclassifier system arranged as a binary hierarchy was developed to group classes for easier and progressive discrimination [27]. This paper substantially expands the scope of such a ...

متن کامل

A New Approach for Handling Null Values in Web Log Using KNN and Tabu Search KNN

When the data mining procedures deals with the extraction of interesting knowledge from web logs is known as Web usage mining. The result of any mining is successful, only if the dataset under consideration is well preprocessed. One of the important preprocessing steps is handling of null/missing values. Handlings of null values have been a great bit of test for researcher. Various methods are ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003